The Effect of the Guide Tree on Multiple Sequence Alignments and Subsequent Phylogenetic Analysis

نویسندگان

  • Serita M. Nelesen
  • Kevin Liu
  • D. Zhao
  • C. Randal Linder
  • Tandy J. Warnow
چکیده

Many multiple sequence alignment methods (MSAs) use guide trees in conjunction with a progressive alignment technique to generate a multiple sequence alignment but use differing techniques to produce the guide tree and to perform the progressive alignment. In this paper we explore the consequences of changing the guide tree used for the alignment routine. We evaluate four leading MSA methods (ProbCons, MAFFT, Muscle, and ClustalW) as well as a new MSA method (FTA, for “Fixed Tree Alignment”) which we have developed, on a wide range of simulated datasets. Although improvements in alignment accuracy can be obtained by providing better guide trees, in general there is little effect on the “accuracy” (measured using the SP-score) of the alignment by improving the guide tree. However, RAxML-based phylogenetic analyses of alignments based upon better guide trees tend to be much more accurate. This impact is particularly significant for ProbCons, one of the best MSA methods currently available, and our method, FTA. Finally, for very good guide trees, phylogenies based upon FTA alignments are more accurate than phylogenies based upon ProbCons alignments, suggesting that further improvements in phylogenetic accuracy may be obtained through algorithms of this type.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The effect of the guide tree on multiple sequence alignments and subsequent phylogenetic analyses.

Many multiple sequence alignment methods (MSAs) use guide trees in conjunction with a progressive alignment technique to generate a multiple sequence alignment but use differing techniques to produce the guide tree and to perform the progressive alignment. In this paper we explore the consequences of changing the guide tree used for the alignment routine. We evaluate four leading MSA methods (P...

متن کامل

Co-estimation of Phylogeny-aware Alignment and Phylogenetic Tree

The phylogeny-aware alignment algorithm implemented in both PRANK and PAGAN has been found to produce highly accurate alignments for comparative sequence analysis. However, the algorithm’s reliance on a guide tree during the alignment process can bias the resulting alignment rendering it unsuitable for phylogenetic inference. To overcome these issues, we have developed a new tool, Canopy, for p...

متن کامل

Measuring guide-tree dependency of inferred gaps in progressive aligners

MOTIVATION Multiple sequence alignments are generally reconstructed using a progressive approach that follows a guide-tree. During this process, gaps are introduced at a cost to maximize residue pairing, but it is unclear whether inferred gaps reflect actual past events of sequence insertions or deletions. It has been found that patterns of inferred gaps in alignments contain information toward...

متن کامل

Using guide trees to construct multiple-sequence evolutionary HMMs

MOTIVATION Score-based progressive alignment algorithms do dynamic programming on successive branches of a guide tree. The analogous probabilistic construct is an Evolutionary HMM. This is a multiple-sequence hidden Markov model (HMM) made by combining transducers (conditionally normalised Pair HMMs) on the branches of a phylogenetic tree. METHODS We present general algorithms for constructin...

متن کامل

Phylogenetic characterization of the fusion genes of the Newcastle disease viruses isolated in Fars province poultry farms during 2009-2011

Despite routine vaccination programs against Newcastle disease (ND), sporadic cases have occasionally occurred that remain a constant threat to commercial poultry. Ten isolates of Newcastle disease viruses (NDV) from infected broiler chicken cases were obtained from various locations in Fars province during 2009-2011 and genetically analyzed using reverse transcription polymerase chain reaction...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008